Overview

Dataset statistics

Number of variables15
Number of observations113167
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory20.5 MiB
Average record size in memory189.9 B

Variable types

Numeric14
Categorical1

Alerts

acq_date has a high cardinality: 4037 distinct values High cardinality
latitude is highly correlated with longitude and 1 other fieldsHigh correlation
longitude is highly correlated with latitude and 1 other fieldsHigh correlation
brightness is highly correlated with confidenceHigh correlation
confidence is highly correlated with brightnessHigh correlation
elevation is highly correlated with latitude and 1 other fieldsHigh correlation
max_temp is highly correlated with min_tempHigh correlation
min_temp is highly correlated with max_tempHigh correlation
latitude is highly correlated with longitudeHigh correlation
longitude is highly correlated with latitudeHigh correlation
brightness is highly correlated with confidenceHigh correlation
confidence is highly correlated with brightnessHigh correlation
max_temp is highly correlated with min_tempHigh correlation
min_temp is highly correlated with max_temp and 1 other fieldsHigh correlation
pressure is highly correlated with min_tempHigh correlation
brightness is highly correlated with confidenceHigh correlation
confidence is highly correlated with brightnessHigh correlation
max_temp is highly correlated with min_tempHigh correlation
min_temp is highly correlated with max_tempHigh correlation
latitude is highly correlated with longitude and 1 other fieldsHigh correlation
longitude is highly correlated with latitude and 2 other fieldsHigh correlation
brightness is highly correlated with confidenceHigh correlation
confidence is highly correlated with brightnessHigh correlation
elevation is highly correlated with latitude and 2 other fieldsHigh correlation
solar_radiation is highly correlated with humidityHigh correlation
max_temp is highly correlated with min_temp and 1 other fieldsHigh correlation
min_temp is highly correlated with max_tempHigh correlation
pressure is highly correlated with max_tempHigh correlation
humidity is highly correlated with longitude and 2 other fieldsHigh correlation
df_index is uniformly distributed Uniform
df_index has unique values Unique
brightness has 55828 (49.3%) zeros Zeros
confidence has 56730 (50.1%) zeros Zeros
elevation has 10312 (9.1%) zeros Zeros
precipitation has 83660 (73.9%) zeros Zeros

Reproduction

Analysis started2022-02-19 01:07:44.417211
Analysis finished2022-02-19 01:08:49.362313
Duration1 minute and 4.95 seconds
Software versionpandas-profiling v3.1.1
Download configurationconfig.json

Variables

df_index
Real number (ℝ≥0)

UNIFORM
UNIQUE

Distinct113167
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean59981.47299
Minimum0
Maximum119999
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile6013.3
Q130006.5
median59918
Q390017.5
95-th percentile113999.7
Maximum119999
Range119999
Interquartile range (IQR)60011

Descriptive statistics

Standard deviation34634.73488
Coefficient of variation (CV)0.5774238803
Kurtosis-1.200955517
Mean59981.47299
Median Absolute Deviation (MAD)30006
Skewness0.00165729683
Sum6787923354
Variance1199564860
MonotonicityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
01
 
< 0.1%
800181
 
< 0.1%
800291
 
< 0.1%
800281
 
< 0.1%
800271
 
< 0.1%
800261
 
< 0.1%
800251
 
< 0.1%
800241
 
< 0.1%
800231
 
< 0.1%
800221
 
< 0.1%
Other values (113157)113157
> 99.9%
ValueCountFrequency (%)
01
< 0.1%
11
< 0.1%
21
< 0.1%
31
< 0.1%
61
< 0.1%
71
< 0.1%
81
< 0.1%
91
< 0.1%
101
< 0.1%
111
< 0.1%
ValueCountFrequency (%)
1199991
< 0.1%
1199981
< 0.1%
1199971
< 0.1%
1199961
< 0.1%
1199951
< 0.1%
1199941
< 0.1%
1199931
< 0.1%
1199921
< 0.1%
1199911
< 0.1%
1199901
< 0.1%

latitude
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct49415
Distinct (%)43.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.36434835
Minimum19.02689934
Maximum70.30950165
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum19.02689934
5-th percentile28.85108967
Q132.11719894
median35.65330124
Q341.3871994
95-th percentile48.44680023
Maximum70.30950165
Range51.28260231
Interquartile range (IQR)9.270000458

Descriptive statistics

Standard deviation7.753203256
Coefficient of variation (CV)0.2075027024
Kurtosis3.33444905
Mean37.36434835
Median Absolute Deviation (MAD)4.192001343
Skewness1.407421965
Sum4228411.21
Variance60.11216073
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
41.633201618
 
< 0.1%
42.6824989316
 
< 0.1%
41.6357994114
 
< 0.1%
41.6319999714
 
< 0.1%
19.4069004114
 
< 0.1%
41.4626998914
 
< 0.1%
33.4137992912
 
< 0.1%
19.4062004112
 
< 0.1%
33.4187011712
 
< 0.1%
33.4177017212
 
< 0.1%
Other values (49405)113029
99.9%
ValueCountFrequency (%)
19.026899341
< 0.1%
19.279699332
< 0.1%
19.329299932
< 0.1%
19.329900742
< 0.1%
19.331600192
< 0.1%
19.333099372
< 0.1%
19.333700182
< 0.1%
19.333900452
< 0.1%
19.335100172
< 0.1%
19.335399632
< 0.1%
ValueCountFrequency (%)
70.309501652
< 0.1%
70.216201782
< 0.1%
70.147102362
< 0.1%
69.429603581
< 0.1%
69.053298951
< 0.1%
69.050796511
< 0.1%
69.050201421
< 0.1%
68.985603332
< 0.1%
68.477699281
< 0.1%
68.464599611
< 0.1%

longitude
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct52912
Distinct (%)46.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-99.70368883
Minimum-166.0917053
Maximum-67.0042038
Zeros0
Zeros (%)0.0%
Negative113167
Negative (%)100.0%
Memory size884.2 KiB

Quantile statistics

Minimum-166.0917053
5-th percentile-124.0730972
Q1-111.8425522
median-94.92340088
Q3-86.25489807
95-th percentile-80.42549896
Maximum-67.0042038
Range99.08750153
Interquartile range (IQR)25.58765411

Descriptive statistics

Standard deviation17.94213104
Coefficient of variation (CV)-0.1799545358
Kurtosis1.541401073
Mean-99.70368883
Median Absolute Deviation (MAD)10.36230469
Skewness-1.28612591
Sum-11283167.35
Variance321.9200662
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-87.1340026912
 
< 0.1%
-88.0149993912
 
< 0.1%
-111.59410111
 
< 0.1%
-111.588897710
 
< 0.1%
-87.133903510
 
< 0.1%
-88.0102996810
 
< 0.1%
-88.0130004910
 
< 0.1%
-155.281295810
 
< 0.1%
-81.6756973310
 
< 0.1%
-81.676002510
 
< 0.1%
Other values (52902)113062
99.9%
ValueCountFrequency (%)
-166.09170532
< 0.1%
-164.65499881
< 0.1%
-164.4591982
< 0.1%
-164.4053042
< 0.1%
-164.37269592
< 0.1%
-164.36529542
< 0.1%
-164.35969542
< 0.1%
-164.27520752
< 0.1%
-164.25799562
< 0.1%
-164.12640382
< 0.1%
ValueCountFrequency (%)
-67.00420382
< 0.1%
-67.520599372
< 0.1%
-67.700599672
< 0.1%
-67.891998292
< 0.1%
-68.022598272
< 0.1%
-68.337699892
< 0.1%
-68.580101012
< 0.1%
-68.600502012
< 0.1%
-68.610900882
< 0.1%
-68.916801452
< 0.1%

acq_date
Categorical

HIGH CARDINALITY

Distinct4037
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Memory size8.4 MiB
2/28/2015 12:00:00 AM
 
195
6/25/2015 12:00:00 AM
 
166
2/29/2020 12:00:00 AM
 
134
9/7/2020 12:00:00 AM
 
125
9/8/2020 12:00:00 AM
 
125
Other values (4032)
112422 

Length

Max length22
Median length21
Mean length20.87833909
Min length20

Characters and Unicode

Total characters2362739
Distinct characters15
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique15 ?
Unique (%)< 0.1%

Sample

1st row4/5/2021 12:00:00 AM
2nd row3/5/2021 12:00:00 AM
3rd row4/22/2013 12:00:00 AM
4th row3/22/2013 12:00:00 AM
5th row4/8/2020 12:00:00 AM

Common Values

ValueCountFrequency (%)
2/28/2015 12:00:00 AM195
 
0.2%
6/25/2015 12:00:00 AM166
 
0.1%
2/29/2020 12:00:00 AM134
 
0.1%
9/7/2020 12:00:00 AM125
 
0.1%
9/8/2020 12:00:00 AM125
 
0.1%
4/1/2021 12:00:00 AM125
 
0.1%
2/28/2021 12:00:00 AM123
 
0.1%
6/23/2015 12:00:00 AM119
 
0.1%
2/28/2014 12:00:00 AM118
 
0.1%
9/9/2020 12:00:00 AM118
 
0.1%
Other values (4027)111819
98.8%

Length

Histogram of lengths of the category
ValueCountFrequency (%)
12:00:00113167
33.3%
am113167
33.3%
2/28/2015195
 
0.1%
6/25/2015166
 
< 0.1%
2/29/2020134
 
< 0.1%
9/7/2020125
 
< 0.1%
9/8/2020125
 
< 0.1%
4/1/2021125
 
< 0.1%
2/28/2021123
 
< 0.1%
6/23/2015119
 
< 0.1%
Other values (4029)112055
33.0%

Most occurring characters

ValueCountFrequency (%)
0598669
25.3%
2321647
13.6%
1308698
13.1%
/226334
 
9.6%
226334
 
9.6%
:226334
 
9.6%
A113167
 
4.8%
M113167
 
4.8%
338845
 
1.6%
734409
 
1.5%
Other values (5)155135
 
6.6%

Most occurring categories

ValueCountFrequency (%)
Decimal Number1457403
61.7%
Other Punctuation452668
 
19.2%
Space Separator226334
 
9.6%
Uppercase Letter226334
 
9.6%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0598669
41.1%
2321647
22.1%
1308698
21.2%
338845
 
2.7%
734409
 
2.4%
833675
 
2.3%
932263
 
2.2%
629927
 
2.1%
429874
 
2.0%
529396
 
2.0%
Other Punctuation
ValueCountFrequency (%)
/226334
50.0%
:226334
50.0%
Uppercase Letter
ValueCountFrequency (%)
A113167
50.0%
M113167
50.0%
Space Separator
ValueCountFrequency (%)
226334
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common2136405
90.4%
Latin226334
 
9.6%

Most frequent character per script

Common
ValueCountFrequency (%)
0598669
28.0%
2321647
15.1%
1308698
14.4%
/226334
 
10.6%
226334
 
10.6%
:226334
 
10.6%
338845
 
1.8%
734409
 
1.6%
833675
 
1.6%
932263
 
1.5%
Other values (3)89197
 
4.2%
Latin
ValueCountFrequency (%)
A113167
50.0%
M113167
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII2362739
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0598669
25.3%
2321647
13.6%
1308698
13.1%
/226334
 
9.6%
226334
 
9.6%
:226334
 
9.6%
A113167
 
4.8%
M113167
 
4.8%
338845
 
1.6%
734409
 
1.5%
Other values (5)155135
 
6.6%

brightness
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct4048
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean155.7244602
Minimum0
Maximum454.4499969
Zeros55828
Zeros (%)49.3%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median290.0500031
Q3305.4000092
95-th percentile321.4499969
Maximum454.4499969
Range454.4499969
Interquartile range (IQR)305.4000092

Descriptive statistics

Standard deviation153.9131969
Coefficient of variation (CV)0.9883687938
Kurtosis-1.983785855
Mean155.7244602
Median Absolute Deviation (MAD)56.6499939
Skewness-0.01618842497
Sum17622869.99
Variance23689.27219
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
055828
49.3%
303.75131
 
0.1%
304.25131
 
0.1%
301.5129
 
0.1%
305.75129
 
0.1%
303.5128
 
0.1%
297.75126
 
0.1%
302.8000031126
 
0.1%
300.75125
 
0.1%
300.5125
 
0.1%
Other values (4038)56189
49.7%
ValueCountFrequency (%)
055828
49.3%
282.80000311
 
< 0.1%
282.94999691
 
< 0.1%
283.85000611
 
< 0.1%
283.94999691
 
< 0.1%
284.05000311
 
< 0.1%
284.10000612
 
< 0.1%
284.19999691
 
< 0.1%
284.251
 
< 0.1%
284.35000611
 
< 0.1%
ValueCountFrequency (%)
454.44999691
< 0.1%
453.80000311
< 0.1%
452.751
< 0.1%
452.69999691
< 0.1%
451.251
< 0.1%
450.90000921
< 0.1%
450.44999691
< 0.1%
450.19999692
< 0.1%
449.30000311
< 0.1%
445.55000311
< 0.1%

confidence
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct101
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.28904186
Minimum0
Maximum100
Zeros56730
Zeros (%)50.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q365
95-th percentile93
Maximum100
Range100
Interquartile range (IQR)65

Descriptive statistics

Standard deviation35.34490287
Coefficient of variation (CV)1.094640808
Kurtosis-1.408391289
Mean32.28904186
Median Absolute Deviation (MAD)0
Skewness0.4297878162
Sum3654054
Variance1249.262159
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
056730
50.1%
1003553
 
3.1%
631224
 
1.1%
621206
 
1.1%
611193
 
1.1%
651156
 
1.0%
681151
 
1.0%
641140
 
1.0%
661139
 
1.0%
591116
 
1.0%
Other values (91)43559
38.5%
ValueCountFrequency (%)
056730
50.1%
12
 
< 0.1%
25
 
< 0.1%
311
 
< 0.1%
414
 
< 0.1%
514
 
< 0.1%
620
 
< 0.1%
730
 
< 0.1%
837
 
< 0.1%
943
 
< 0.1%
ValueCountFrequency (%)
1003553
3.1%
99256
 
0.2%
98277
 
0.2%
97292
 
0.3%
96312
 
0.3%
95333
 
0.3%
94456
 
0.4%
93378
 
0.3%
92379
 
0.3%
91439
 
0.4%

elevation
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
ZEROS

Distinct2988
Distinct (%)2.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean461.6968551
Minimum-67
Maximum3588
Zeros10312
Zeros (%)9.1%
Negative124
Negative (%)0.1%
Memory size884.2 KiB

Quantile statistics

Minimum-67
5-th percentile0
Q153
median185
Q3530
95-th percentile1965
Maximum3588
Range3655
Interquartile range (IQR)477

Descriptive statistics

Standard deviation640.3855877
Coefficient of variation (CV)1.387026099
Kurtosis2.795833258
Mean461.6968551
Median Absolute Deviation (MAD)166
Skewness1.867905143
Sum52248848
Variance410093.7009
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
010312
 
9.1%
11701
 
0.6%
12606
 
0.5%
5569
 
0.5%
6555
 
0.5%
10539
 
0.5%
7506
 
0.4%
9481
 
0.4%
4425
 
0.4%
73421
 
0.4%
Other values (2978)98052
86.6%
ValueCountFrequency (%)
-672
 
< 0.1%
-662
 
< 0.1%
-592
 
< 0.1%
-542
 
< 0.1%
-532
 
< 0.1%
-522
 
< 0.1%
-492
 
< 0.1%
-442
 
< 0.1%
-422
 
< 0.1%
-416
< 0.1%
ValueCountFrequency (%)
35882
< 0.1%
35452
< 0.1%
35192
< 0.1%
34762
< 0.1%
34672
< 0.1%
34542
< 0.1%
34132
< 0.1%
34042
< 0.1%
33892
< 0.1%
33774
< 0.1%

precipitation
Real number (ℝ≥0)

ZEROS

Distinct3556
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.561878816
Minimum0
Maximum268.6499939
Zeros83660
Zeros (%)73.9%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.009999999776
95-th percentile10.10999966
Maximum268.6499939
Range268.6499939
Interquartile range (IQR)0.009999999776

Descriptive statistics

Standard deviation6.155908974
Coefficient of variation (CV)3.941348658
Kurtosis120.5330578
Mean1.561878816
Median Absolute Deviation (MAD)0
Skewness8.191279937
Sum176753.1399
Variance37.8952153
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
083660
73.9%
0.0099999997763556
 
3.1%
0.019999999551512
 
1.3%
0.02999999933919
 
0.8%
0.03999999911700
 
0.6%
0.05000000075536
 
0.5%
0.05999999866467
 
0.4%
0.0700000003360
 
0.3%
0.07999999821317
 
0.3%
0.09000000358267
 
0.2%
Other values (3546)20873
 
18.4%
ValueCountFrequency (%)
083660
73.9%
0.0099999997763556
 
3.1%
0.019999999551512
 
1.3%
0.02999999933919
 
0.8%
0.03999999911700
 
0.6%
0.05000000075536
 
0.5%
0.05999999866467
 
0.4%
0.0700000003360
 
0.3%
0.07999999821317
 
0.3%
0.09000000358267
 
0.2%
ValueCountFrequency (%)
268.64999391
< 0.1%
182.05999761
< 0.1%
171.78999331
< 0.1%
161.92999271
< 0.1%
153.77000431
< 0.1%
145.83999631
< 0.1%
140.22999571
< 0.1%
139.05000311
< 0.1%
132.02000431
< 0.1%
130.91000371
< 0.1%

solar_radiation
Real number (ℝ≥0)

HIGH CORRELATION

Distinct38393
Distinct (%)33.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean376.8661961
Minimum0
Maximum1138.5
Zeros1
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile176.5329971
Q1308.0749969
median389.2200012
Q3456.6000061
95-th percentile530.9000244
Maximum1138.5
Range1138.5
Interquartile range (IQR)148.5250092

Descriptive statistics

Standard deviation108.1562769
Coefficient of variation (CV)0.2869885335
Kurtosis0.1462103876
Mean376.8661961
Median Absolute Deviation (MAD)73.41000366
Skewness-0.4423704553
Sum42648816.81
Variance11697.78024
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
496.899993925
 
< 0.1%
394.100006125
 
< 0.1%
394.899993923
 
< 0.1%
328.899993923
 
< 0.1%
444.799987823
 
< 0.1%
426.399993923
 
< 0.1%
355.600006123
 
< 0.1%
337.200012222
 
< 0.1%
394.399993922
 
< 0.1%
448.100006121
 
< 0.1%
Other values (38383)112937
99.8%
ValueCountFrequency (%)
01
 
< 0.1%
11
 
< 0.1%
34
< 0.1%
51
 
< 0.1%
6.6999998091
 
< 0.1%
81
 
< 0.1%
8.51
 
< 0.1%
11.359999661
 
< 0.1%
11.409999851
 
< 0.1%
11.960000041
 
< 0.1%
ValueCountFrequency (%)
1138.51
< 0.1%
1130.6999511
< 0.1%
1129.9000241
< 0.1%
1129.0999761
< 0.1%
1125.3000492
< 0.1%
1080.9000241
< 0.1%
9321
< 0.1%
907.29998781
< 0.1%
906.51
< 0.1%
906.40002441
< 0.1%

max_temp
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct123
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean75.62136489
Minimum-7
Maximum120
Zeros1
Zeros (%)< 0.1%
Negative4
Negative (%)< 0.1%
Memory size884.2 KiB

Quantile statistics

Minimum-7
5-th percentile47
Q166
median77
Q388
95-th percentile97
Maximum120
Range127
Interquartile range (IQR)22

Descriptive statistics

Standard deviation15.72790035
Coefficient of variation (CV)0.2079822332
Kurtosis0.3144021522
Mean75.62136489
Median Absolute Deviation (MAD)11
Skewness-0.6250560348
Sum8557843
Variance247.3668493
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
843483
 
3.1%
823326
 
2.9%
903270
 
2.9%
813114
 
2.8%
863093
 
2.7%
882967
 
2.6%
792958
 
2.6%
912957
 
2.6%
722911
 
2.6%
702866
 
2.5%
Other values (113)82222
72.7%
ValueCountFrequency (%)
-71
 
< 0.1%
-22
 
< 0.1%
-11
 
< 0.1%
01
 
< 0.1%
15
< 0.1%
23
< 0.1%
31
 
< 0.1%
42
 
< 0.1%
52
 
< 0.1%
61
 
< 0.1%
ValueCountFrequency (%)
1201
 
< 0.1%
1183
 
< 0.1%
1176
 
< 0.1%
1164
 
< 0.1%
11519
 
< 0.1%
1148
 
< 0.1%
11349
< 0.1%
1129
 
< 0.1%
11165
0.1%
11047
< 0.1%

min_temp
Real number (ℝ)

HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION

Distinct129
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean52.71154135
Minimum-23
Maximum252
Zeros13
Zeros (%)< 0.1%
Negative127
Negative (%)0.1%
Memory size884.2 KiB

Quantile statistics

Minimum-23
5-th percentile28
Q142
median54
Q364
95-th percentile75
Maximum252
Range275
Interquartile range (IQR)22

Descriptive statistics

Standard deviation15.2262542
Coefficient of variation (CV)0.288859969
Kurtosis2.6648999
Mean52.71154135
Median Absolute Deviation (MAD)11
Skewness-0.06279060746
Sum5965207
Variance231.8388169
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
543712
 
3.3%
553505
 
3.1%
573400
 
3.0%
483298
 
2.9%
593246
 
2.9%
523227
 
2.9%
613221
 
2.8%
633200
 
2.8%
463195
 
2.8%
503117
 
2.8%
Other values (119)80046
70.7%
ValueCountFrequency (%)
-231
 
< 0.1%
-212
< 0.1%
-201
 
< 0.1%
-183
< 0.1%
-173
< 0.1%
-162
< 0.1%
-152
< 0.1%
-144
< 0.1%
-133
< 0.1%
-123
< 0.1%
ValueCountFrequency (%)
2521
 
< 0.1%
2442
 
< 0.1%
2435
< 0.1%
2411
 
< 0.1%
2322
 
< 0.1%
2211
 
< 0.1%
2031
 
< 0.1%
1991
 
< 0.1%
1982
 
< 0.1%
1893
< 0.1%

pressure
Real number (ℝ≥0)

HIGH CORRELATION
HIGH CORRELATION

Distinct45429
Distinct (%)40.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1307.859446
Minimum0
Maximum3875.02002
Zeros13
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile1002.599976
Q11015.599976
median1024
Q31429.585022
95-th percentile2613.66792
Maximum3875.02002
Range3875.02002
Interquartile range (IQR)413.9850464

Descriptive statistics

Standard deviation549.3876093
Coefficient of variation (CV)0.420066247
Kurtosis1.652808829
Mean1307.859446
Median Absolute Deviation (MAD)13
Skewness1.563571233
Sum148006530
Variance301826.7452
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1017566
 
0.5%
1016565
 
0.5%
1019558
 
0.5%
1015512
 
0.5%
1021509
 
0.4%
1018503
 
0.4%
1020485
 
0.4%
1023480
 
0.4%
1017.5470
 
0.4%
1015.5461
 
0.4%
Other values (45419)108058
95.5%
ValueCountFrequency (%)
013
< 0.1%
58.430000311
 
< 0.1%
63.770000461
 
< 0.1%
75.150001531
 
< 0.1%
75.839996341
 
< 0.1%
83.040000921
 
< 0.1%
83.760002141
 
< 0.1%
88.290000921
 
< 0.1%
88.319999691
 
< 0.1%
89.330001831
 
< 0.1%
ValueCountFrequency (%)
3875.020021
< 0.1%
3772.6201171
< 0.1%
3704.0700681
< 0.1%
3670.5700681
< 0.1%
3667.8100591
< 0.1%
3662.3601071
< 0.1%
3643.0700681
< 0.1%
3634.8100591
< 0.1%
3631.9299321
< 0.1%
3630.5800781
< 0.1%

humidity
Real number (ℝ≥0)

HIGH CORRELATION

Distinct8612
Distinct (%)7.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean62.44548765
Minimum0
Maximum100
Zeros43
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile30.40999985
Q151.56999969
median64.73000336
Q375.02999878
95-th percentile87.52999878
Maximum100
Range100
Interquartile range (IQR)23.45999908

Descriptive statistics

Standard deviation17.27081706
Coefficient of variation (CV)0.2765743004
Kurtosis-0.06259896518
Mean62.44548765
Median Absolute Deviation (MAD)11.48999786
Skewness-0.5279682642
Sum7066768.501
Variance298.2811219
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
65533
 
0.5%
59473
 
0.4%
68470
 
0.4%
62447
 
0.4%
57389
 
0.3%
58374
 
0.3%
64365
 
0.3%
63364
 
0.3%
70362
 
0.3%
61362
 
0.3%
Other values (8602)109028
96.3%
ValueCountFrequency (%)
043
< 0.1%
4.9899997711
 
< 0.1%
5.8400001531
 
< 0.1%
61
 
< 0.1%
6.1300001141
 
< 0.1%
6.1500000951
 
< 0.1%
6.219999791
 
< 0.1%
6.4400000571
 
< 0.1%
6.6300001141
 
< 0.1%
6.6799998282
 
< 0.1%
ValueCountFrequency (%)
10017
< 0.1%
99.980003362
 
< 0.1%
99.959999081
 
< 0.1%
99.930000311
 
< 0.1%
99.919998172
 
< 0.1%
99.910003662
 
< 0.1%
99.900001531
 
< 0.1%
99.889999391
 
< 0.1%
99.879997251
 
< 0.1%
99.870002751
 
< 0.1%

wind_speed
Real number (ℝ≥0)

Distinct2275
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean12.42003687
Minimum0
Maximum92.19999695
Zeros16
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile5.699999809
Q18.699999809
median11.39999962
Q315.10000038
95-th percentile22.20000076
Maximum92.19999695
Range92.19999695
Interquartile range (IQR)6.400000572

Descriptive statistics

Standard deviation5.271979985
Coefficient of variation (CV)0.424473779
Kurtosis3.239147241
Mean12.42003687
Median Absolute Deviation (MAD)3.199999809
Skewness1.182350619
Sum1405538.312
Variance27.79377296
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10.300000192101
 
1.9%
9.1999998092078
 
1.8%
11.399999622038
 
1.8%
8.1000003811753
 
1.5%
12.800000191724
 
1.5%
13.899999621637
 
1.4%
6.9000000951405
 
1.2%
151345
 
1.2%
16.100000381264
 
1.1%
17.200000761046
 
0.9%
Other values (2265)96776
85.5%
ValueCountFrequency (%)
016
< 0.1%
0.10000000152
 
< 0.1%
0.30000001193
 
< 0.1%
0.4000000063
 
< 0.1%
0.60000002382
 
< 0.1%
0.69999998811
 
< 0.1%
0.80000001194
 
< 0.1%
0.91962933546
 
< 0.1%
0.96933907271
 
< 0.1%
12
 
< 0.1%
ValueCountFrequency (%)
92.199996951
< 0.1%
86.800003051
< 0.1%
82.400001531
< 0.1%
57.51
< 0.1%
54.599998471
< 0.1%
54.099998471
< 0.1%
52.900001531
< 0.1%
52.700000761
< 0.1%
51.700000761
< 0.1%
51.400001531
< 0.1%

winddir
Real number (ℝ≥0)

Distinct19864
Distinct (%)17.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean185.9792182
Minimum0
Maximum360
Zeros32
Zeros (%)< 0.1%
Negative0
Negative (%)0.0%
Memory size884.2 KiB

Quantile statistics

Minimum0
5-th percentile60.7028183
Q1129.1000061
median183.6999969
Q3241.5617523
95-th percentile317.0498352
Maximum360
Range360
Interquartile range (IQR)112.4617462

Descriptive statistics

Standard deviation77.17927915
Coefficient of variation (CV)0.4149887278
Kurtosis-0.7150800723
Mean185.9792182
Median Absolute Deviation (MAD)56.19999695
Skewness0.08179573841
Sum21046710.18
Variance5956.64113
MonotonicityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
170101
 
0.1%
180100
 
0.1%
19097
 
0.1%
22090
 
0.1%
212.796234185
 
0.1%
257.992919984
 
0.1%
15084
 
0.1%
153.584
 
0.1%
22583
 
0.1%
157.579
 
0.1%
Other values (19854)112280
99.2%
ValueCountFrequency (%)
032
< 0.1%
0.47838619352
 
< 0.1%
0.51
 
< 0.1%
0.81249225141
 
< 0.1%
1.4960803991
 
< 0.1%
21
 
< 0.1%
2.5554404261
 
< 0.1%
31
 
< 0.1%
3.55
 
< 0.1%
4.4000000951
 
< 0.1%
ValueCountFrequency (%)
3605
< 0.1%
359.97332761
 
< 0.1%
359.96844481
 
< 0.1%
359.9253541
 
< 0.1%
359.92199714
< 0.1%
359.87023936
< 0.1%
359.86416632
 
< 0.1%
359.83920291
 
< 0.1%
359.79211431
 
< 0.1%
359.69235231
 
< 0.1%

Interactions

Correlations

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

df_indexlatitudelongitudeacq_datebrightnessconfidenceelevationprecipitationsolar_radiationmax_tempmin_temppressurehumiditywind_speedwinddir
0034.327702-82.2900014/5/2021 12:00:00 AM302.0000005100.00229.69999781481020.00000047.2400029.0210.441437
1134.327702-82.2900013/5/2021 12:00:00 AM0.000000000.00265.60000662381018.29998840.5299997.570.500000
2231.697500-92.6066974/22/2013 12:00:00 AM303.89999453460.01536.09002781571056.56005966.4800038.856.099998
3331.697500-92.6066973/22/2013 12:00:00 AM0.0000000460.01397.00000078561249.18994175.37000310.3119.800003
4641.711800-85.8247994/8/2020 12:00:00 AM300.500000522353.14369.29998873501112.30004975.36000116.9338.766754
5741.711800-85.8247993/8/2020 12:00:00 AM0.00000002350.00421.50000061371025.50000041.88000120.1202.800003
6832.430302-82.6799013/16/2017 12:00:00 AM295.00000039800.00486.11999557301014.28002948.0000009.8290.692566
7932.430302-82.6799012/16/2017 12:00:00 AM0.0000000800.00382.8500066638751.27002047.8300029.2301.299988
81030.625000-88.91780111/20/2020 12:00:00 AM301.05000363310.00323.14001577631279.06005977.79000111.879.186386
91130.625000-88.91780110/20/2020 12:00:00 AM0.0000000310.00349.60000686662116.09008880.97000111.074.199997

Last rows

df_indexlatitudelongitudeacq_datebrightnessconfidenceelevationprecipitationsolar_radiationmax_tempmin_temppressurehumiditywind_speedwinddir
11315711999027.770700-80.9228974/19/2018 12:00:00 AM319.34999182230.00564.65002488681026.05004965.54000113.200000194.000000
11315811999127.770700-80.9228973/19/2018 12:00:00 AM0.0000000230.08482.36999587631100.28002969.94000210.100000235.100006
11315911999265.307297-154.7306987/1/2015 12:00:00 AM309.550003851850.00277.80999870551019.07000773.0000009.200000253.000000
11316011999365.307297-154.7306986/1/2015 12:00:00 AM0.00000001850.00328.26001063371015.20001233.18999917.20000185.199997
11316111999431.988600-91.5199978/3/2018 12:00:00 AM315.39999443180.31494.61999593702102.87988378.4599998.100000110.892326
11316211999531.988600-91.5199977/3/2018 12:00:00 AM0.0000000187.91296.39999492752840.30004982.90000217.20000199.199997
11316311999632.928001-84.4652023/28/2015 12:00:00 AM298.399994682520.00412.51998956371020.79998851.34999812.900000321.851410
11316411999732.928001-84.4652022/28/2015 12:00:00 AM0.00000002522.45303.58999658351032.90002460.48000011.40000076.699997
11316511999832.990601-91.68740110/2/2011 12:00:00 AM308.44999775350.00433.01998976441023.40002462.4500018.10000059.000000
11316611999932.990601-91.6874019/2/2011 12:00:00 AM0.0000000350.04413.10000698712320.86010762.07000012.70000055.000000